Toward Multi-modal Music Emotion Classification
نویسندگان
چکیده
The performance of categorical music emotion classification that divides emotion into classes and uses audio features alone for emotion classification has reached a limit due to the presence of a semantic gap between the object feature level and the human cognitive level of emotion perception. Motivated by the fact that lyrics carry rich semantic information of a song, we propose a multi-modal approach to help improve categorical music emotion classification. By exploiting both the audio features and the lyrics of a song, the proposed approach improves the 4-class emotion classification accuracy from 46.6% to 57.1%. The results also show that the incorporation of lyrics significantly enhances the classification accuracy of valence.
منابع مشابه
Music Emotion Regression based on Multi-modal Features1
Music emotion regression is considered more appropriate than classification for music emotion retrieval, since it resolves some of the ambiguities of emotion classes. In this paper, we propose an AdaBoost-based approach for music emotion regression, in which emotion is represented in PAD model and multi-modal features are employed, including audio, MIDI and lyric features. We first demonstrate ...
متن کاملBoosting for Multi-Modal Music Emotion Classification
With the explosive growth of music recordings, automatic classification of music emotion becomes one of the hot spots on research and engineering. Typical music emotion classification (MEC) approaches apply machine learning methods to train a classifier based on audio features. In addition to audio features, the MIDI and lyrics features of music also contain useful semantic information for pred...
متن کاملAn Audio-Visual Approach to Music Genre Classification through Affective Color Features
This paper presents a study on classifying music by affective visual information extracted from music videos. The proposed audio-visual approach analyzes genre specific utilization of color. A comprehensive set of color specific image processing features used for affect and emotion recognition derived from psychological experiments or art-theory is evaluated in the visual and multi-modal domain...
متن کاملMulti-label classification of music by emotion
This work studies the task of automatic emotion detection in music. Music may evoke more than one different emotion at the same time. Single-label classification and regression cannot model this multiplicity. Therefore, this work focuses on multi-label classification approaches, where a piece of music may simultaneously belong to more than one class. Seven algorithms are experimentally compared...
متن کاملThe Color of Music: Synesthesia or emotion-mediated cross-modal associations?
The cross-modal literature posits a weak-to-strong continuum of synesthesia. One extreme views cross-modal associations as idiosyncratic and unique to synesthetes. The other extreme suggests that cross-modal associations follow a general pattern across individuals, and are mediated by emotional associations. We tested these views by examining differences between music-color synesthetes and non-...
متن کامل